Human tissue factor inhibitor

ABSTRACT

A cDNA clone having a base sequence for human tissue factor inhibitor (TFI) has been developed and characterized and the amino acid sequence of the TFI has been determined.

CROSS-REFERENCE TO RELATED APPLICATION

This is a Continuation of application Ser. No. 08/355,351 filed Dec. 13,1994, now abandoned which is a continuation of application Ser. No.08/093,285 filed Jul. 15, 1993, now U.S. Pat. No. 5,466,783, which is acontinuation of application Ser. No. 07/566,280 filed Aug. 13, 1990, nowabandoned, which is a divisional of application Ser. No. 07/123,753filed Nov. 23, 1987, now U.S. Pat. No. 4,966,852 which is acontinuation-in-part of application Ser. No. 07,077,366, filed Jul. 23,1987, now abandoned.

BACKGROUND OF THE INVENTION

This invention relates to a coagulation inhibitor known as tissue factorinhibitor (TFI) and alternatively as lipoprotein associated coagulationinhibitor (LACI). More particularly, the invention relates to a cDNAclone representing essentially the full size TFI.

The coagulation cascade that occurs in mammalian blood comprises twodistinct systems--the so-called intrinsic and extrinsic systems. Thelatter system is activated by exposure of blood to tissue thromboplastin(Factor III), hereinafter referred to as tissue factor (TF). Tissuefactor is a lipoprotein that arises in the plasma membrane of many celltypes and in which the brain and lung are particularly rich. Upon cominginto contact with TF, plasma Factor VII or its activated form, FactorVII_(a), forms a calcium-dependent complex with TF and thenproteolytically activates Factor X to Factor X_(a), and Factor IX toFactor IX_(a).

Early studies concerning the regulation of TF-initiated coagulationshowed that incubation of TF (in crude tissue thromboplastinpreparations) with serum inhibited its activity in vitro and preventedits lethal effect when it was infused into mice. Extensive studies byHjort, Scand. J. Clin. Lab. Invest. 9, Suppl. 27, 76-97 (1957),confirmed and extended previous work in the area, and led to theconclusion that an inhibitory moiety in serum recognized the FactorVII-TF complex. Consistent with this hypothesis are the facts that theinhibition of TF that occurs in plasma requires the presence of Ca²⁺(which is also necessary for the binding of Factor VII/VII_(a) to TF)and that inhibition can be prevented and/or reversed by chelation ofdivalent cations with EDTA. More recent investigations have shown thatnot only Factor VII_(a) but also catalytically active Factor X_(a) andan additional factor are required for the generation of TF inhibition inplasma or serum. See Broze and Miletich, Blood 69, 150-155 (1987), andSanders et al., Ibid., 66, 204-212 (1985). This additional factor,defined herein as tissue factor inhibitor (TFI), and alternatively aslipoprotein associated coagulation inhibitor (LACI), is present inbarium-absorbed plasma and appears to be associated with lipoproteins,since TFI functional activity segregates with the lipoprotein fractionthat floats when serum is centrifuged at a density of 1.21 g/cm³.According to Broze and Miletich, supra, and Proc. Natl. Acad. Sci. USA84, 1886-1890 (1987), HepG2 cells (a human hepatoma cell line) secretean inhibitory moiety with the same characteristics as the TFI present inplasma.

In copending application Ser. No. 77,366, filed Jul. 23, 1987, apurified tissue factor inhibitor (TFI) is disclosed which was secretedfrom HepG2 cells. It was found to exist in two forms, a TFI₁, migratingat about 37-40,000 daltons and a TFI₂ at about 25-26,000 daltons, asdetermined by sodium dodecylsulfate polyacrylamide gel electrophores is(SDS-PAGE). A partial N-terminal amino acid sequence for the TFI wasassigned as: ##STR1## wherein X--X had not been determined. Thedisclosure of said application is incorporated herein by reference.

BRIEF DESCRIPTION OF THE INVENTION

In accordance with the present invention, the complete coding sequenceof a cDNA clone representing essentially the full size tissue factorinhibitor (TFI) has been developed.

Initially, human placental and fetal liver λgt11 cDNA libraries werescreened with a rabbit polyclonal antiserum raised against a purifiedTFI. Immunologically positive clones were further screened for ¹²⁵I-Factor X_(a) binding activity. Seven clones were obtained which wereimmunologically and functionally active. The longest clone,placental-derived λP9, was 1.4 kilobases (kb) long while the other sixwere 1.0 kb in length. Partial DNA sequencing showed the 1.0 kb clonesto have sequences identical to part of the longer 1.4 kb clone.Nucleotide sequence analysis showed that λP9 consisted of a 1432basepair (bp) cDNA insert that includes a 5'-noncoding region of 133 bp,an open reading frame of 912 bp, a stop codon, and a 3'-noncoding regionof 384 bp.

The cDNA sequence encodes a 31,950 Dalton protein of 276 amino acidswhich includes 18 cysteines and 7 methionines. The translated amino acidsequence shows that a signal peptide of about 28 amino acids precedesthe mature TFI protein. It will be understood that the "mature" TFI isdefined to include both TFI and methionyl TFI by virtue of the ATGtranslational codon in the λP9 clone described herein.

There are three potential N-linked glycosylation sites in the TFIprotein with the sequence Asn-X-Ser/Thr, wherein X can be any of thecommon 20 amino acids. These sites are at amino acid positions Asn 145,Asn 195, and Asn 256, when the first methionine after the 5'-noncodingregion is assigned amino acid position +1.

The translated amino acid sequence of TFI shows several discernibledomains, including a highly negatively charged N-terminal, a highlypositively charged carboxy-terminal, and an intervening portionconsisting of 3 homologous domains with sequences typical of Kunitz-typeenzyme inhibitors. Based on a homology study, TFI appears to be a memberof the basic protease inhibitor gene superfamily.

The original source of the protein material for developing the cDNAclone λP9 was human placental tissue. Such tissue is widely availableafter delivery by conventional surgical procedures. The λgt11 (lac5 nin5c1857 S100) used herein is a well-known and commonly available lambdaphage expression vector. Its construction and restriction endonucleasemap is described by Young and Davis, Proc. Natl. Acad. Sci. USA 80,1194-1198 (1983).

Northern blot analysis showed that the following liver-derived celllines: Chang liver, HepG2 hepatoma, and SK-HEP-1 hepatoma, all contained2 major species of mRNA (1.4 and 4.4 kb) which hybridized with the TFIcDNA.

The cloning of the cDNA for TFI and development of its entire proteinsequence and structural domains as disclosed herein permits detailedstructure-functional analyses and provides a foundation for study of itsbiosynthetic regulations. The invention thus is important to medicalscience in the study of the coagulation cascade with respect to agentswhich are able to inhibit Factor X_(a) and Factor VII_(a) /TF enzymaticcomplex.

DETAILED DESCRIPTION OF THE INVENTION

While the specification concludes with claims particularly pointing outand distinctly claiming the subject matter regarded as forming thepresent invention, it is believed that the invention will be betterunderstood from the following detailed description of preferredembodiments of the invention taken in conjunction with the appendeddrawings, in which:

BRIEF DESCRIPTION OF THE DRAWINGS

FIG. 1 shows the screening of λgt11 clones with ¹²⁵ I-Factor X_(a).Cloned phage lysates (0.1 ml) were spotted on a nitrocellulose paper bysuction using a dot blot apparatus. The nitrocellulose paper was thenprobed with ¹²⁵ I-Factor X_(a) and autoradiographed as describedhereinafter. The clones that appear as dark spots are positive clonesthat bind 125I-Factor X_(a). Control λgt11 (lower right corner) andother clones do not bind ¹²⁵ I-Factor X_(a).

FIG. 2 shows a partial restriction map and sequencing strategy for theλP9 inserts. The scale at the bottom indicates the nucleotide position.The thick bar represents the coding region. The thin bars represent 5'-and 3'-noncoding regions. The restriction endonuclease sites wereconfirmed by digestion. The arrows show the overlapping M13 clones usedto sequence the cDNA.

FIG. 3 shows the nucleotide sequence and translated amino acid sequenceof the human TFI cDNA. Nucleotides are numbered on the left and aminoacids on the right. The underlined sequences have been independentlyconfirmed by amino acid sequence analysis of the purified TFI proteinand two V₈ protease+trypsin digested peptides. Amino acid+1 was assignedto the first methionine after a stop codon of the 5'-noncoding region.Potential N-lined glycosylation sites are marked by asterisks.

FIG. 4 is a graphical representation which shows the charge distributionof the amino acid sequence in TFI. Charges are calculated from the firstresidue to the i-th residues and displayed at the i-th residue. Thus thevalue of the i-th position is the summation of all charges from thefirst residue to the i-th residue and the difference of the chargesbetween the i-th and j-th residue (j>i) is the net charge of thefragment from i-th to j-th residue.

FIG. 5 is a graphical representation which shows the hydrophobicityprofile of TFI. The hydrophobicity profile was analyzed by a computerprogram whereby the hydrophobicity index of the amino acid residues isdefined as the depth to which an amino acid residue is buried inside aprotein (from X-ray crystallographic data) Kidera et al., J. ProteinChem. 4, 23-55 (1985)!. The hydrophobicity profile along the sequencewas smoothed using the program ICSSCU in IMSL Library IMSL LibraryReference Manual, 9th ed., Institute for Mathematical and StatisticalSubroutine Library, Houston, Tex. (1982)!.

FIG. 6 shows an alignment of the basic protease inhibitor domains of TFIwith other basic protease inhibitors. All the sequences except TFI wereobtained from the National Biomedical Research Foundation ProteinSequence Database (Georgetown University, Washington, D.C., release 13,June 1987). 1. Bovine basic protease inhibitor precursor; 2. Bovinecolostrum trypsin inhibitor; 3. Bovine serum basic protease inhibitor;4. Edible snail isoinhibitor K; 5. Red sea turtle basic proteaseinhibitor (only amino acids 1-79 presented); 6. Western sand viper venombasic protease inhibitor I; 7. Ringhals venom basic protease inhibitorII; 8. Cape cobra venom basic protease inhibitor II; 9. Russell's vipervenom basic protease inhibitor II; 10. Sand viper venom basic proteaseinhibitor III; 11. Eastern green mamba venom basic protease inhibitor Ihomolog; 12. Black mamba venom basic protease inhibitor B; 13. Blackmamba venom basic protease inhibitor E; 14. Black mamba venom basicprotease inhibitor I; 15. Black mamba venom basic protease inhibitor K;16. β-1-Bungarotoxin B chain (minor); 17. β-1-Bungarotoxin B chain(major); 18. β-2-Bungarotoxin B chain; 19. Horse inter-α-trypsininhibitor amino acids 1-57(1); 58-123 (2)!; 20. Pig inter-α-trypsininhibitor amino acids 1-57(1); 58-123(2)!; 21. Bovine inter-α-trypsininhibitor amino acids 1-57(1); 58-123(2)!; 22. Humanα-1-microglobulin/inter-α-trypsin inhibitor precursor amino acids227-283(1); 284-352(2)!; 23. TFI amino acids 47-117(1); 118-188(2);210-280(3)!. Gaps were included in 16, 17, 18 to achieve best alignment.Standard one letter codes for amino acids are used.

FIG. 7 shows the Northern blot analysis of RNAs from 3 liver-derivedcell lines. Ten μg of poly(A)⁺ RNA were used per lane. Lane 1, Changliver cell; lane 2, SK-HEP-1 hepatoma cell; lane 3, HepG2 hepatoma cell.

Standard biochemical nomenclature is used herein in which the nucleotidebases are designated as adenine (A); thymine (T); guanine (G); andcytosine (C). Corresponding nucleotides are, for example,deoxyguanosine-5'-triphosphate (dGTP). As is conventional forconvenience in the structural representation of a DNA nucleotidesequence, only one strand is shown in which A on one strand connotes Ton its complement and G connotes C. Amino acids are shown either bythree letter or one letter abbreviations as follows:

    ______________________________________                                        Abbreviated Designation    Amino Acid                                         ______________________________________                                        A         Ala              Alanine                                            C         Cys              Cysteine                                           D         Asp              Aspartic acid                                      E         Glu              Gluamic acid                                       F         Phe              Phenylalanine                                      G         Gly              Glycine                                            H         His              Histidine                                          I         Ile              Isoleucine                                         K         Lys              Lysine                                             L         Leu              Leucine                                            M         Met              Methionine                                         N         Asn              Asparagine                                         P         Pro              Proline                                            Q         Gln              Glutamine                                          R         Arg              Arginine                                           S         Ser              Serine                                             T         Thr              Threonine                                          V         Val              Valine                                             W         Trp              Tryptophan                                         Y         Tyr              Tyrosine                                           ______________________________________                                    

Commonly available restriction endonucleases described herein have thefollowing restriction sequences and (indicated by arrows) cleavagepatterns: ##STR2##

In order to illustrate specific preferred embodiments of the inventionin greater detail, the following exemplary laboratory preparative workwas carried out.

EXAMPLE 1 Materials

Human placental and fetal liver cDNA libraries were obtained fromClonetech. The protoblot immunoscreening kit was purchased from PromegaBiotech. Restriction enzymes were from New England Biolab,. Calfintestine alkaline phosphatase, T4 DNA ligase, DNA polymerase I(Klenow), exonuclease III and S1 nuclease were from Boehringer Mannheim.dNTPs were from P. L. Biochemicals. 5'- α-³⁵ S!-thio-dATP (600 Ci/mmol)was from Amersham. Sequencing kit (Sequenase) was from United StatesBiochemicals. Chang liver cells (ATCC CCL 13) and HepG2 hepatoma cells(ATCC HB 8065) were obtained from the American Type Culture Collection.SK-HEP-1 hepatoma cells were originally derived from a liveradenocarcinoma by G. Trempe of Sloan-Kettering Institute for CancerResearch in 1971 and are now widely and readily available.

125I-Factor X_(a) was prepared by radio-labeling using Iodo-gen. Thespecific activity was 2000 dpm/ng. Greater than 97% of radioactivity wasprecipitable with 10% trichloroacetic acid (TCA). The iodinated proteinretained >80% of their catalytic activity toward Spectrozyme X_(a)(American Diagnostica product).

An anti-TFI-Ig Sepharose® 4B column was prepared as follows: A peptide(called TFI-peptide) containing a sequence corresponding to the aminoacid sequence 3-25 of the mature TFI was synthesized using Biosystem'ssolid phase peptide synthesis system. The TFI-peptide (5 mg) wasconjugated to 10 mg of Keyhole lympet hemocyanin by glutaraldehyde. TwoNew Zealand white rabbits were each immunized by intradermal injectionwith a homogenate containing 1 ml of Freund complete adjuvant and 1 mlof conjugate (200 μg of TFI-peptide). One month later the rabbits wereeach boosted with a homogenate containing 1 ml of Freund incompleteadjuvant and 1 ml of conjugate (100 μg of conjugate). Antiserum wascollected each week for 3 months and booster injections were performedmonthly. To isolate specific antibody against TFI-peptide, the antiserumwas chromatographed on a TFI-peptide Sepharose 4B column. The column waswashed with 10 volumes of PBS (0.4M NaCl-0.1M benzamidine-1% Triton®X-100) and the same solution without Triton X-100. The antibody waseluted with 0.1M glycine/HCl, pH 2.2, immediately neutralized by adding1/10 volume of 1M Tris-OH and dialyzed against saline solution. Theisolated antibody was coupled to cyanogen bromide activated Sepharose 4Bby the manufacturer's (Pharmacia) method and used to isolate TFI fromthe cell culture medium.

Chang liver cell was cultured by the method described previously byBroze and Miletich, Proc. Natl. Acad. Sci. USA 84, 1886-1890 (1987). Theconditioned medium was chromatographed on the anti-TFI-Ig Sepharose 4Bcolumn. The column was washed with 10 volumes of PBS-1% Triton X-100 andPBS. The bound TFI was eluted with 0.1M glycine/HCl, pH 2.2. Theimmunoaffinity isolated TFI was further purified by preparative sodiumdodecylsulfate polyacrylamide gel electrophores is (Savant apparatus).Amino acid analysis of the final product showed the same amino terminalsequence as the TFI isolated from HepG2 cells as described in copendingapplication, Ser. No. 77,366, filed Jul. 23, 1987. The isolated Changliver TFI was then used to immunize rabbits by the immunization protocoldescribed above. The antiserum obtained had a titer of about 100 μg/mlin the double immunodiffusion test. This antiserum was used in theimmuno-screening of λgt11 cDNA libraries.

Methods

Isolation of cDNA clones. Methods for screening the placental and fetalliver cDNA libraries with antibody, plaque purification, and preparationof λ-phage lysate and DNA were as described by Wun and Kretzmer, FEBSLett. 1, 11-16 (1987). The antiserum was pre-adsorbed with BNN97 λgt11lysate and diluted 1/500 for screening the library.

Screening of factor X_(a) binding activity

Recombinant proteins induced by isopropyl-β-thiogalactoside fromimmuno-positive λ-phage isolates or from control λgt11 were screened forFactor X_(a) binding activity. The λ-phage lysates (0.1 ml) werefiltered through a nitrocellulose paper using a dot-blot apparatus (BioRad). The nitrocellulose paper was then immersed and agitated in aphosphate buffered saline containing 5 mg/ml bovine serum albumin and2.5 mg/ml bovine gamma globulin at room temperature for 1 h. Thesolution was replaced with 125I-Factor X_(a) (1.0×106 cmp/ml) dissolvedin the same solution supplemented with 0.1 mg/ml heparin and theagitation continued for another hour. The nitrocellulose paper was thenwashed with phosphate buffered saline containing 0.05% Tween® 20. Thewashing buffer was changed every 5 min., 4 times. The nitrocellulosepaper was then air-dryed and prepared for autoradiography using KodakXR5 film. The film was developed after 1 week exposure.

Preparation of poly(A)⁺ RNA and Northern blotting. Total RNAs wereprepared from cultured Chang liver cell, HepG2 hepatoma cell andSK-HEP-1 hepatoma cell using the sodium perchlorate extraction method ofLizardi, and Engelberg, Anal. Biochem. 98, 116-122, (1979). Poly(A)⁺RNAS were isolated by batch-wise adsorption on oligo(dT)-cellulose (P-LBiochemical, type 77F) using the procedure recommended by themanufacturer. For Northern blot analysis, 10 μg each of poly(A)⁺ RNA wastreated with glyoxal Thomas, Methods Enzymol. 100, 255-266 (1983)! andsubjected to agarose gel electrophoresis in a buffer containing 10 mMsodium phosphate, pH 7.0. Bethesda Research Laboratory's RNA ladder wasused as a molecular weight marker. The RNAs were transblotted onto anitrocellulose paper which was then baked at 80° for 2 h. The insert DNAof λP9 clone was radiolabeled with ³² p by nick translation and used asa probe Maniatis et al., Molecular Cloning: A Laboratory Model, ColdSpring Laboratory, Cold Spring Harbor, N.Y., (1982)!. The blot washybridized with 5×10⁶ cpm of the probe in 5 ml of a solution containing50% formamide, 5×SSC, 50 mM sodium phosphate, pH 7.0, 250 μg/mldenatured salmon sperm DNA, and 1× Denhardt's solution at 42° for 16 h.The filter was washed in 0.1% sodium dodecylsulfate (SDS), 2×SSC at roomtemperature 3 times, each time 5 min., and in 0.1% SDS, 0.2×SSC at 50°twice, each 5 min. The nitrocellulose paper was then air dried,autoradiographed for 3 days at -70° using Kodak XAR-5 film andintensifying screen.

Other recombinant DNA methods. Preparation of cloned λgt11 DNA,subcloning in pUC19 plasmid and M13mp18 vector, generation of deletionby exonuclease III digestion and DNA sequencing by dideoxy method Sangeret al., Proc. Natl. Acad. Sci. USA 83, 6776-6780 (1977)!, were performedas described by Wun and Kretzmer, supra.

The program FASTP written by Lipman and Pearson, Science 227 1435-1441(1985), was used to identify homologous families of proteins fromNational Biomedical Research Foundation Sequence Data Bank (release 13,June 1987) and to align the sequences within the homologous family.

RESULTS

Screening of cDNA libraries

A number of cell lines were screened for the presence of TFI in theconditioned media and it was found that several liver-derived celllines, Chang liver, HepG2 hepatoma, and SK-HEP-1 hepatoma secrete TFI inculture. Initially, an antiserum against TFI was used to screen a humanfetal liver λgt11 cDNA library (106 plaque forming units), and 15immunologically positive clones were obtained. Subsequently, the samemethod was used to screen a placental λgt11 cDNA library. Out of 10⁶plaque forming units, 10 immunologically positive clones were obtained.These clones were plaque purified and the lysates of the purified cloneswere tested for the functional activity of TFI. Theisopropylthiogalactoside induced phage lysates were absorbed on thenitrocellulose paper and screened for the ¹²⁵ I-Factor X_(a) bindingactivity. FIG. 1 demonstrates that some of these immunologicallypositive clones showed the ability to bind the ¹²⁵ I-Factor X_(a) on thenitrocellulose paper. In all, 3 out of 15 immunologically positive fetalliver clones, and 4 out of 10 immunologically positive placental clonesshowed ¹²⁵ I-Factor X_(a) binding activity. These immunologically andfunctionally positive clone were digested with EcoR1 and the size of theinserts were estimated by gel electrophoresis. One clone from placentallibrary (λP9) had an insert of approximately 1.4 kb, while all the otherclones contain inserts of approximately 1.0 kb. Partial DNA sequencinghas shown that 1.0 kb clones contain sequences identical to part of thelonger 1.4 kb placental clone (λP9). The λP9 was therefore selected forcomplete sequencing.

Nucleotide sequence and predicted Protein sequence of TFI cDNA isolate

The λP9 clone was subjected to restriction mapping, M13 subcloning andsequencing by the strategy shown in FIG. 2. The entire sequence wasdetermined on both strands by the exonuclease III deletion methodHenikoff, Gene 28, 351-359 (1984)! and found to consist of 1432 bases inlength. The sequence is shown in FIG. 3. It contains a 5'-noncodingregion of 133 bases, an open reading frame of 912 nucleotides, and a3'-noncoding region of 387 nucleotides. The first ATG occurs atnucleotide 134 in the sequence TAGATGA which was closely followed by asecond ATG at nucleotide 146 in the sequence ACAATGA. These are possiblythe initiation sequences, although they differ from the proposedconsensus sequence for initiation by eukaryotic ribosome, ACCATGG Kozak,Cell 44, 283-292 (1986)!. Twenty-eight amino acids precede a sequencecorresponding to the N-terminal of the mature protein. The length andcomposition of the hydrophobic segment of these 28 amino acids aretypical of signal sequences Von Heijne, Eur. J. Biochem. 133, 17-21(1983); J. Mol. Biol. 184, 99-105 (1985)!. A signal peptidase possiblycleaves at Ala₂₈ -Asp₂₉ to give rise to a mature protein. The sequencepredicted for mature TFI consists of 276 amino acids that contains 18cysteine residues and 7 methionines. The calculated mass of 31,950Daltons based on the deduced protein sequence for mature TFI is somewhatlower than the 37-40 kDa estimated by sodium dodecyl sulfatepolyacrylamide gel electrophoresis of isolated protein, and thedifference probably reflects the contribution of glycosylation to themobility of the natural protein. The deduced protein sequencecorresponding to the mature protein contains 3 potential N-linkedglycosylation sites with the sequence Asn-X-Thr/Ser (amino acidpositions 145, 195, and 256). Amino acid sequence analysis of purifiedwhole TFI and two isolated proteolytic fragments match exactly theprotein sequence deduced from cDNA sequence (FIG. 3, underlined),indicating the isolated cDNA clone encodes TFI. The 3'-noncoding regionis A+T rich (70% A+T). Neither consensus polyadenylation signal, AATAAAProudfoot and Brownlee, Nature 252, 359-362 (1981)! nor the poly A tailwas found in this clone, possibly due to artefactual loss of part of 3'terminal portion during construction of the library.

Charge distribution, hydrophobicity/hydrophilicity, and internalhomology

The translated amino acid sequence of the TFI contains 27 lysines, 17arginines, 11 aspartic acids, and 25 glutamic acids. The chargedistribution along the protein is highly uneven as shown in FIG. 4. Thesignal peptide region contains 2 positively charged lysine with 26neutral residues. The amino-terminal region of the mature proteincontains a highly negatively charged stretch. Six of the first 7residues are either aspartic acid or glutamic acid which are followedclosely by two more negatively charged amino acids downstream before apositively charged lysine residue is appears. The center portion of themolecule is generally negatively charged. At the carboxy terminal, thereis a highly positively charged segment. The amino acids 265 to 293 ofTFI contain 14 positively charged amino acids including a 6-consecutivearginine+lysine residues.

The predicted hydrophilicity/hydrophobicity profile of TFI protein isshown in FIG. 5. The signal peptide contains a highly hydrophobic regionas expected. The rest of the molecule appears rather hydrophilic.

The translated amino acid sequence of TFI contains several discernibledomains. Besides the highly negatively charged N-terminal domain and thehighly positively charged C-terminal domain, the center portion consistsof 3 homologous domains which have the typical sequences of theKunitz-type inhibitors (see below).

Homology to other proteins

By searching the National Biomedical Research Foundation sequence database, it was found that the N-terminal domain and C-terminal domain ofTFI do not show significant homology to other known proteins. The 3internal homologous domains, however, are each homologous to thesequences of other basic protease inhibitors including bovine pancreaticbasic protease inhibitor (aprotinin), venom basic protease inhibitors,and inter-α-trypsin inhibitors (FIG. 6). It is noteworthy that disulfidebonding structure is highly conserved in all these inhibitors. Based onthese homologies, it is clear that TFI belongs to the basic proteaseinhibitor gene superfamily.

Northern blotting

Poly(A)+ RNAs were purified from TFI-producing liver-derived cell lines,Chang liver, HepG2 hepatoma, and SK-HEP-1 hepatoma cells. The poly(A)+RNAs were resolved by denaturing agarose gel electrophoresis,transblotted onto a nitrocellulose paper and probed with ³² P-labeledTFI cDNA (λP9). As shown in FIG. 7, two major bands of hybridizationwere observed that corresponded to mRNAs of 1.4 kb and 4.4 kb in allthree cell lines tested. Several other cell lines were tested which donot produce detectable amounts of TFI and in which no hybridization withthe probe was found. (data not shown).

Various other examples will be apparent to the person skilled in the artafter reading the present disclosure without departing from the spiritand scope of the invention. It is intended that all such furtherexamples be included within the scope of the appended claims.

What is claimed is:
 1. A recombinant polypeptide comprising the aminoacid sequence of mature tissue factor inhibitor (TFI) as shown in FIG.3.
 2. A polypeptide as recited in claim 1 wherein said mature tissuefactor inhibitor consists of amino acids 29-304 as shown in FIG.
 3. 3.The polypeptide of claim 1 further comprising an additional amino acidextension imnmediately preceding said mature tissue factor inhibitor. 4.A polypeptide consisting of mature tissue factor inhibitor (29-304)immediately preceded by an N-terminal methionine residue, alanineresidue, or methionine-alanine di-peptide.
 5. A purified and isolatedpolypeptide comprising an amino acid sequence selected from the groupconsisting of:(a) Cys Ala Phe Lys Ala Asp Asp Gly Pro Cys Lys Ala IleMet Lys Arg Phe Phe Phe Asn Ile Phe Thr Arg Gln Cys Glu Glu Phe Ile TyrGly Ala Cys Glu Gly Asn Gln Asn Arg Phe Glu Ser Leu Glu Glu Cys Lys LysMet Cys; (b) Cys Phe Leu Glu Glu Asp Pro Gly Ile Cys Arg Gly Tyr Ile ThrArg Tyr Phe Tyr Asn Asn Gln Thr Lys Gln Cys Glu Arg Phe Lys Tyr Gly GlyCys Leu Gly Asn Met Asn Asn Phe Glu Thr Leu Glu Glu Cys Lys Asn Ile Cys;(c) Cys Leu Thr Pro Ala Asp Arg Gly Leu Cys Arg Ala Asn Glu Asn Arg PheTyr Tyr Asn Ser Val Ile Gly Lys Cys Arg Pro Phe Lys Tyr Ser Gly Cys GlyGly Asn Glu Asn Asn Phe Thr Ser Lys Gln Glu Cys Leu Arg Ala Cys; (d) LeuLys Leu Met His Ser Phe Cys Ala Phe Lys Ala Asp Asp Gly Pro Cys Lys AlaIle Met Lys Arg Phe Phe Phe Asn Ile Phe Thr Arg Gln Cys Glu Glu Phe IleTyr Gly Ala Cys Glu Gly Asn Gln Asn Arg Phe Glu Ser Leu Glu Glu Cys LysLys Met Cys Thr Arg Asp Asn Ala Asn Arg Ile Ile Lys Thr Thr Leu; (e) GlnGln Glu Lys Pro Asp Phe Cys Phe Leu Glu Glu Asp Pro Gly Ile Cys Arg GlyTyr Ile Thr Arg Tyr Phe Tyr Asn Asn Gln Thr Lys Gln Cys Glu Arg Phe LysTyr Gly Gly Cys Leu Gly Asn Met Asn Asn Phe Glu Thr Leu Glu Glu Cys LysAsn Ile Cys Glu Asp Gly Pro Asn Gly Phe Gln Val Asp Asn Tyr Gly; and (f)Glu Phe His Gly Pro Ser Trp Cys Leu Thr Pro Ala Asp Arg Gly Leu Cys ArgAla Asn Glu Asn Arg Phe Tyr Tyr Asn Ser Val Ile Gly Lys Cys Arg Pro PheLys Tyr Ser Gly Cys Gly Gly Asn Glu Asn Asn Phe Thr Ser Lys Gln Glu CysLeu Arg Ala Cys Lys Lys Gly Phe Ile Gln Arg Ile Ser Lys Gly Gly Leu. 6.A purified and isolated polypeptide having an amino acid sequenceselected from the group consisting of:(a) Cys Ala Phe Lys Ala Asp AspGly Pro Cys Lys Ala Ile Met Lys Arg Phe Phe Phe Asn Ile Phe Thr Arg GlnCys Glu Glu Phe Ile Tyr Gly Ala Cys Glu Gly Asn Gln Asn Arg Phe Glu SerLeu Glu Glu Cys Lys Lys Met Cys; (b) Cys Phe Leu Glu Glu Asp Pro Gly IleCys Arg Gly Tyr Ile Thr Arg Tyr Phe Tyr Asn Asn Gln Thr Lys Gln Cys GluArg Phe Lys Tyr Gly Gly Cys Leu Gly Asn Met Asn Asn Phe Glu Thr Leu GluGlu Cys Lys Asn Ile Cys; and (c) Cys Leu Thr Pro Ala Asp Arg Gly Leu CysArg Ala Asn Glu Asn Arg Phe Tyr Tyr Asn Ser Val Ile Gly Lys Cys Arg ProPhe Lys Tyr Ser Gly Cys Gly Gly Asn Glu Asn Asn Phe Thr Ser Lys Gln GluCys Leu Arg Ala Cys.
 7. A recombinant polypeptide having the amino acidsequence of:Asp Ser Glu Glu Asp Glu Glu His Thr Ile Ile Thr Asp Thr GluLeu Pro Pro Leu Lys Leu Met His Ser Phe Cys Ala Phe Lys Ala Asp Asp GlyPro Cys Lys Ala Ile Met Lys Arg Phe Phe Phe Asn Ile Phe Thr Arg Gln CysGlu Glu Phe Ile Tyr Gly Ala Cys Glu Gly Asn Gln Asn Arg Phe Glu Ser LeyGlu Glu Cys Lys Lys Met Cys Thr Arg Asp Asn Ala Asn Arg Ile Ile Lys ThrThr Leu Gln Gln Glu Lys Pro Asp Phe Cys Phe Leu Glu Glu Asp Pro Gly IleCys Arg Gly Tyr Ile Thr Arg Tyr Phe Tyr Asn Asn Gln Thr Lys Gln Cys GluArg Phe Lys Tyr Gly Gly Cys Leu Gly Asn Met Asn Asn Phe Glu Thr Leu GluGlu Cys Lys Asn Ile Cys Glu Asp Gly Pro Asn Gly Phe Gln Val Asp Asn TyrGly Thr Gln Leu Asn Ala Val Asn Asn Ser Leu Thr Pro Gln Ser Thr Lys ValPro Ser Leu Phe Glu Phe His Gly Pro Ser Trp Cys Leu Thr Pro Ala Asp ArgGly Leu Cys Arg Ala Asn Glu Asn Arg Phe Tyr Tyr Asn Ser Val Ile Gly LysCys Arg Pro Phe Lys Tyr Ser Gly Cys Gly Gly Asn Glu Asn Asn Phe Thr SerLys Gln Glu Cys Leu Arg Ala Cys Lys Lys Gly Phe Ile Gln Arg Ile Ser LysGly Gly Leu Ile Lys Thr Lys Arg Lys Arg Lys Lys Gln Arg Val Lys Ile AlaTyr Glu Glu Ile Phe Val Lys Asn Met.
 8. A purified and isolatedpolypeptide having the amino acid sequence of:Asp Ser Glu Glu Asp GluGlu His Thr Ile Ile Thr Asp Thr Glu Leu Pro Pro Leu Lys Leu Met His SerPhe Cys Ala Phe Lys Ala Asp Asp Gly Pro Cys Lys Ala Ile Met Lys Arg PhePhe Phe Asn Ile Phe Thr Arg Gln Cys Glu Glu Phe Ile Tyr Gly Ala Cys GluGly Asn Gln Asn Arg Phe Glu Ser Leu Glu Glu Cys Lys Lys Met Cys Thr ArgAsp Asn Ala Asn Arg Ile Ile Lys Thr Thr Leu Gln Glu Glu Lys Pro Asp PheCys Phe Leu Glu Glu Asp Pro Gly Ile Cys Arg Gly Tyr Ile Thr Arg Tyr PheTyr Asn Asn Gln Thr Lys Gln Cys Glu Arg Phe Lys Tyr Gly Gly Cys Leu GlyAsn Met Asn Asn Phe Glu Thr Leu Glu Glu Cys Lys Asn Ile Cys Glu Asp GlyPro Asn Gly Phe Gln Val Asp Asn Tyr Gly Thr Gln Leu Asn Ala Val Asn AsnSer Leu Thr Pro Gln Ser Thr Lys Val Pro Ser Leu Phe Glu Phe His Gly ProSer Trp Cys Leu Thr Pro Ala Asp Arg Gly Leu Cys Arg Ala Asn Glu Asn ArgPhe Tyr Tyr Asn Ser Val Ile Gly Lys Cys Arg Pro Phe Lys Tyr Ser Gly CysGly Gly Asn Glu Asn Asn Phe Thr Ser Lys Gln Glu Cys Leu Arg Ala Cys LysLys Gly Phe Ile Gln Arg Ile Ser Lys Gly Gly Leu Ile Lys Thr Lys Arg LysArg Lys Lys Gln Arg Val Lys Ile Ala Tyr Glu Glu Ile Phe Val Lys Asn Met.